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Abstract Snake venoms, especially those from the two subfamilies, Crotalinae and 
Viperinae, contained a lot of serine proteases. They were responsible for the hemorrhage, 
shock, or disorder of blood coagulation after envenomation. They acted, by activating, ina- 
ctivating, or other converting effects, on almost all the components of hemostatic and 
fibrinolytic systems. Their sequences were homologous to trypsin—kallikrein serine proteases. Va-. 
tiation of primary sequences out of active center results in the difference of substrate 
specificities and the further difference of biological and pharmacological activities. Because of 
their common and unique properties compared to their physiclogical corresponding factors, 
snake venom proteases are proved to be an excellent model for the study of protease substrate 
discriminating mechanism. Furthermore, they have found an important position both in basic 
research and application of hemostasis and thrombosis in clinic. 
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In evolutionary process. venomous snakes have achieved many proteases in their 
venoms that have an activating or inactivating effect on nearly all the phases of human 
hemostasis (Stocker, 1990: Zhang et al, 1993). A considerable number of venom 
proteases acting on hemostasis have been isolated and characterized (Ouyang er al., 
1992; Zhang er al, 1994a). With recent advances in venom protease researches, accord- 
ing to their biochemical structures and biological functions, here we focus on one of the 
most important pharmacological active constituent groups, namely, the venom serine 
proteases about their structure—function features, evolutionary implication and effects 
on blood coagulation. The amino acid and nucleotide sequence comparison computer 
program used in this paper is referred to Higgins ef af, (1989) based on the method of 
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Wibur eż al. (1983). Protein sequences used in this paper: including venom throm- 
bin-like enzymes (batroxobin from Bothrops atrox venom, flavotrobin from Trime- 
resurus flavoviridis venom. ancrodl and ancrod? from Agkistrodon rhodostoma 
venom, Lmmthr-lik from Lachesis muta muta venom, bilineobin from A. bilineatus 
venom, bothrombin from Bothrops moogni venom), A. contortrix contortrix venom 
protein C activator (ACC-C), Vipera russelli venom factor V activator (RVV-V) and 
Trimeresurus ste jhegeri venom plasminogen activator (TSV-PA), mammalian proteases 
rat trypsin I and kallikrein, bovine thrombin B-chain, human factor K., human tis- 
sue—type plasminogen activator (t-PA) and urokinase (u-PA), were obtained from Pro- 
tein Identification Resource Protein Sequence Database. TSV-TLE. a thrombin-—like 
enzyme from Trimeresurus ste jregeri venom was obtained from Zhang er af.” 


1 Classification of proteases 


Based on a comparison of active site, mechanism of action, and three—dimensional 
structure, four classes are recognized by the International Union of Biochemistry, and 
within these classes, six families of proteases are recognized to date (Neurath, 1984, 
1989). They are Serine proteases I and II, Cysteine proteases. Aspartic proteases, 
Metallo proteases I and TI. Each family has a characteristic set of functional amino 
acid residues arranged in a particular configuration to form the active site. Members of 
each family are believed to have descended from a common ancestor by divergent evolu- 
tion. 

The best characterized and physiologically most versatile protease family is that of 
the mammalian serine proteases (family of Serine protease] ) as exemplified by 
pancreatic trypsin, chymotrypsin, elastase and kallikrein. The hallmark of their active 
sites is the catalytic triad formed by residues Asp102, His57 and Serl95 (chymotrypsin 
numbering throughout the paper, Greer, 1990) (Perona et al., 1995). 


2 Snake venom serine proteases 


The first well-known snake venom serine proteases are thrombin—like enzymes. (for 
a review, see Pirkle et al. 1991). Venom thrombin-like enzymes possess the activity of 
clotting fibrinogen, one of the most common biological functions of their physiological 
homologous thrombin. However, there are many function divergences among throm- 
bin-like enzymes and thrombin. Most of the thrombin—like enzymes studied prefer- 
entially release fibrinogen peptide A (FPA), then accompanied by the slow release of 
small amounts of fibrinogen peptide B (FPB). In case of the thrombin—like enzyme from 
A. halys venom, it first releases FPB and then FPA (Guan ef al., 1988). 

Human protein C is activated in vivo by the proteolytic release of a peptide from 
the aming-terminal end of its heavy chain by a complex of thrombin and throm- 


QZhang Y, Lee W H, Gao R er ai, The amino acid sequence of a thrombin—like enzyme (TSY—TLE) 
from Trimeresurus ste jegert venom. Chinese Biochemical Journal, in press. 
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bomodulin (Eaton et al, 1986). A protein C activator has now been purified to 
homogeneity from A. contortrix contortrix venom (Kisiel ef al, 1987). The molecular 
properties of the venom protein C activator, designated as ACC-C, have been des- 
cribed in detail. ACC-—C is a serine protease. It selectively cleaves the heavy chain of pro 
tein C, forming activated protein C. ACC—C has no fibrinogen—clotting activity. 

A factor V activating enzyme, Russell s viper venom factor V activator (RVV-¥Ẹ ), 
has been purified to homogeneity from the venom of V. russelli ( Kisiel, 1979). 
Thrombin—mediated factor V activation occurs as a result of three cleavages in the fac- 
tor V molecule at Arg709-Ser710. Arg1018-Thr1019, and Argl545—Serl546, but the 
proteolytic activation of factor V by RVV-V involves only a single cleavage in factor 
V that occurs at Argl545—Ser1546 (Kane et al, 1988). Except bovine and human factor 
V, no other protein substrate for RVV—-V has yet been identified, indicating it is a 
highly specific enzyme. 

In all mammalian species so far investigated, two distinct plasminogen activators 
have been identified: utokinase—type and tissue—type plasminogen activators. A specific 
plasminogen activator (TSV—PA) has been detected and cloned from the venom of T. 
ste negeri. It is a typical snake venom serine protease (Zhang et al., 1995a). TSV—PA ac- 
tivates human plasminogen in the same model as that of urokinase, cutting plasminogen 
bond Arg561—Val562 and forming two chain plasmin. TSV-PA is a highly specific 
plasminogen activator. It does not act on factor X, prothrombin, and protein C, nor 
does it show direct fibrinogenolytic activity. 

In snake venoms, kallikrein—like enzymes which release kinin from kininogen have 
been purified and investigated. Their pharmacological effects are hypotension. They are 
serine proteases and their N-terminal sequences are the same as those of other snake ven- 
om serine proteases (Komori ef al, 1988), but their physiological substrate is kini— 
nogen. 

Perhaps fibrinogen is one of the most sensitive substrate for snake venom proteases. 
Due to the large tail and linear molecular structure composed of three chains and with a 
molecular weight of 3.3 10°, fibrinogen can be hydrolyzed by numerous venom 
proteases. Many venom serine proteases, which show neither thrombin-like nor other 
specific activity but which nonspecifically degrade fibrinogen to smal fragments, have 
been purified and characterized (Sugihara et al, 1986; Zhang ef al, 1994b), 

The common features of these venom protease are; all of them show more ort less es- 
ter and amidase activities, but for large protein substrates the specificity varies greatly. 
Furthermore, venom proteases are usually highly specific, like RVV— V it only cuts one 
bond in factor V and TSV-PA only cuts one bond in plasminogen. 


3 Sequence comparison of venom serine proteases 


Fig. 1 shows the sequence alignment of venom serine proteases with their mam— 
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mahan homologous trypsin, It is obvious that these venom serine proteases are only cor- 
responding to a trypsin—like catalytic domain. Usually, they are composed of about 
230-234 amino acid residues. This is around the typical length of other serine protease 
catalytic domain. From alignment of the sequences of snake venom serine proteases, we 
can see that: 1) All of them contained highly conserved regions of serine proteases, in 
which residues His57, Aspl02 and Serl95, which form the catalytic triad of the 
enzyme. are located. 2) With the sequence homology, it was evident that these venom 
proteases were trypsin—like serine proteases. The feature of this group of snake venom 
serine proteases is that their N-terminal sequence is usually V-X-G—G-D-E-C-— 
N-I-N-E-H. 3) Several unusual mutations, which are only found in venom serine 
proteases, occurred in highly conserved regions (structure conserved regions-SCRs, 
Greer, 1990) of serine proteases. Pro28 was replaced by Arg in several venom serine 
proteases, like TSV-PA and ACC-C. Ala56 was replaced by Arg in ACC-C. Position 
Leul55 was occupied by Pro in all venom serine proteases. One of the most striking mu- 
tations occurred in position Gly193. In classical model of serine protease enzymatic 
machanism. this position must be occupied by a Gly. This is the case in all other serine 
proteases except these newly sequenced venom serine proteases. Usually. the common se- 
quence around Serl95 is G-D-S-G-G, but in flavoxobin it becomes F-D-S-G-T. 
Even venom serine proteases keep Gly216 which participates in the form of S1 substrate 
binding pocket (Perona ef af, 1995), mutations also occurred in the amino acid resi- 
dues which are usually Ser214—Trp215 before Gly216. 4) Six disulfide bridges could be as- 
signed from sequence homology with trypsin, located at Cys22-Cysl57, Cys42— 
Cys58. Cys92—Cys245e, Cysl36—-Cys201, Cysl68-Cys182, and Cys191—Cys220, and 
these disulfide bridge assignments were confirmed (Nikai et al, 1995). Among these six 
disulfide bridges, the one that formed by Cys92-Cys245e appeared to be unique to the 
snake venom serine proteases. It presented only in these venom serine proteases. Com- 
pared with mammalian serine proteases. disulfide bridge Cys22—Cys157 does not exist in 
blood coagulation factors like t-PA, urokinase. thrombin and factor IX. Further— 
more, Cys136-Cys201 did not exist in thrombin and factor IX. Obviously, the forming 
of these two disulfide bridges in venom proteases makes the protein folding conformation 
tighter than thrombin. This might be one of the reasons that. unlike thrombin, venom 
serine proteases usually have a narrow substrate specificity. 5} Analysis of the sequences 
indicated that the regions of amino acid residues 59-68, 93-101 and 169-180 were the 
most highly mutable regions in these venom serine proteases. In thrombin the corre- 
sponding region is involved in fibrinogen recognition (Stubbs et al., 1995). The functions 
of these regions in venom proteases are unknown, and they might be important in the 
substrate specificity of venom proteases, 

Fig. 2 shows the sequence similarities of these proteases. As expected, Venom 


proteases exhibite greater homology each other than to mammalian proteases. The simi- 
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larities among venom proteases were 60% —85%. Compared with mammalian physiologi- 


cal proteases, the similarities were around 25% -—40%. The largest sequence similarity 
was with trypsin. The result was in accordance with the gene intron / extron organization 
analysis of batroxobin. which is the same as those of trypsin—kallikrein subfamily (Itoh 
et al, 1988), Although these venom proteases sharing some biological functions with 
thrombin. their similarities with thrombin B-chain were only around 25%. Among ven- 
om proteases, flavoxobin and TSV-TLE were two thrombin-like enzymes from the 
same genus Trimeresurus (T. flavoviridis and T. ste jregeri). They shared a sequence 
similarity of 82%. higher than sequence homology between TSV-TLE and TSV-PA 
(72%), which were from the same snake venom but with different substrate specificity 
(the former clotting fibrinogen, the latter activating plasminogen). Furthermore, batro— 
xobin and ACC-C were highly homologous (70% similarity), but also different in their 
substrate specificity. The former recognized the bonds in fibrinogen to release FPA 
and / or FPB peptides to clot fibrinogen, but the latter selectively cleaved the heavy 
chain of protein C, forming activated protein C. Physiologically, thrombin activates fac- 
tor V, Protein C and clots fibrinogen. Venom proteases share one of the activities of 
thrombin. These make the venom proteases as an excellent model in the study of 
protease substrate discriminating mechanism. which is very useful in bioengineering and 
clinic drug design. 
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Fig. 2 Evolutionary trees of the serine proteases 
The probahle evolutionary relationship of the sequences of serine proteases is 
shown here. The horizontal axis indicates the percent of sequence difference. 


4 Molecular structure modeling of venom serine proteases 


The global conformation of the pancreatic serine proteases is practically the same: 
two compact, similarly folded domains, symmetrically disposed around a two-fold axis 


of symmetry. Differences in substrate specificity can be related to amino acid substitu- 
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tions in the substrate binding site (Neurath, 1989). The modeling of the three dimen- 
sional structure of TSV-PA was performed, based on the crystal structure of trigonal 
bovine pancreas trypsin (protein data base entry code 3P TN) (Zhang et al, 1997). Five 
of the six disulfide bridges of TSV-PA are found in the same positions along the se- 
quence in trypsin. The structurally conserved regions determined through structural align- 
ment are in line with those of Greer (1990). TSV-PA contains an C-terminal amino acid 
extension absent in trypsin and involved in the 92~245e cystine bridge unique to 
TSV~PA. As in all members of the family of serine proteases (Greer, 1990), the 
topology of TSV~PA consists of two subdomains of antiparallel -barrel structure (3 
strands each) and a C-terminal helical segment. The spatial organization of the catalytic 
triad residues His57, Aspl02 and Serl95 at the active site cleft between the two 
subdomains is preserved in TSV-PA. The overall dimensions of TSV-PA are slightly 
larger than those of trypsin, but smaller than those of u-PA. 

As mentioned above, the regions of amino acid residues 59-68 (loop A), 93-101 
(loop B) and 169-180 (loop C) are the most highly mutable regions in venom serine 
proteases. The three loops A. B, C, are exposed to the solvent in the 3D model of 
TSV-PA. TSV-PA, which contains 12 cystein in the form of 6 disulfide bridges, has 
been overexpressed in E. coli (Zhang et al, 1995b). The recombinant TSV-PA had the 
same specific activities as the natural enzyme, when assayed on a variety of substrates in- 
cluding plasminogen. The three regions of TSV-PA, located in the surface loops A, 
B, C around the catalytic site, which differed the most with batroxobin, were replaced 
by corresponding residues of batroxobin. The replacement in TSV-PA of DDE 
(96 a—98) by NVI abolished its plasminogen activation activity without affecting the activ- 
ity on small chromogenic substrates (Zhang et al., 1997). 


5 Evolutionary implication 


The comparison of 57— and 3’~ noncoding regions of cDNAs encoding the venom 
serine proteases; TSV-—PA, batroxobin, and ancrodl from A. rhodostoma venom (Au 
et al, 1993), suprisingly indicates that their 5'— and 3’— untranslated regions are more 
conserved than protein-coding regions, with a degree of identity of about 85% in 
untranslated regions. compared with 60% -—65% in protein coding regions, respectively. 
A similar observation has already been reported in the case of snake venom 
phospholipase A, isoenzymes (Nakashima ef al., 1993). The venom serine proteases 
which have been characterized so far are regarded as a family of proteins which 
specifically interact with different proteins on which they exert their physiological action. 
A phenomenon of accelerated evolution in the protein-coding regions of venom serine 
proteases, allowing an adaptive evolution with diverse physiological activities of pro- 
teases, could therefore be postulated, as has been done in the cases of venom phospho- 


lipase A., the active-site regions of serine protease inhibitors (Hill ez al., 1987) and the 
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major histocompatibility complex gene products (Hughes ez ala 1989). 
6 Clinic study and application 


Because of high specificities and differences in their pharmacological mechanism 
with corresponding blood coagulation factors. venom serine proteases are widely used in 
hemoatasis and thrombosis, both in basic research and in application. For example. 
thrombin time may be prolonged by both the presence of a thrombin inhibitor or a defec- 
tive fibrin polymerization. The batroxobin time is never affected by thrombin inh- 
ibitors, like heparin—antithrombin I complex or hirudin, so Reptilase time (batroxobin 
time) has become a very popular screening test always performed in parallel with 
thrombin (Funk et al, 1971). The finding of a specific protein C activator from snake 
venoms solved the problem of clinic protein C testing, because using physiological acti— 
vator (thrombin) for testing protein C in vivo is real a time consumption process (Stock- 
er, 1990). Compared with t-PA and u-PA, TSV-—PA has a long half-life in vivo 
€ 50-100 times longer). Furthermore. TSV-PA is not inhibited by physiological 
inhibitors, such as plasminogen activator inhibitor I and ÚH. Preliminary in vivo assays 
prove that TSY-PA shows good thrombolytic effects (Wang et al., 1993). 


7 Conclusion 


Snake venoms, especially those from the two subfamilies, Crotalinae and Viperi— 
nae, contain a lot of serine proteases of which the molecular weights are around 2.5 x 
10*-5 x 10°. These proteases were corresponding to a trypsin—like catalytic domain. The 
difference in molecular weight might be explained by carbohydrate content. Because of 
the sequence variation. they have different substrate specificity and furthermore distinc- 
tive biological and pharmacological functions in vivo. Unlike their trypsin homologous, 
they were usually highly specific. for example thrombin-like enzymes, plasminogen 
activator, factor V activator and protein C activator. Up to now, people are still far 
from understanding the interaction between proteases and their natural substrates, espec- 
ially large molecular protein substrates. Because of their common and unique properties 
compared to their physiological corresponding factors, snake venom proteases could be 
used as an excellent model for the study of protease substrate discriminating mechanism. 
Furthermore, they have found a position uniquely in clinic, both in basic research and 
application of hemostasis and thrombosis. 
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